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1 Overview 

Let Cn denote the nth Catalan number, which represents (among other 
things) the number of distinct binary trees that have n undistinguished 
nodes and n + 1 undistinguished leaves. (Here the term node shall refer to 
nonterminal nodes only.) Now imagine that in such a tree, one assigns n 
distinct labels to the nodes, and n + 1 distinct labels to the leaves; let Dn 
denote the number of possible trees with the nodes and leaves so labeled. 
Because there are re! ways to label the nodes, and, independently, (re + 1)1 
ways to label the leaves, we see that Dn = Cn ■ re! (re + 1)1, and hence that 
Cn = Dn/{n\{n + 1)!). 

This note presents a bijection between permutations of length 2re and 
binary trees having re labeled nodes and re + 1 labeled leaves. From the 
existence of this bijection one may infer that Dn = (2re)!. The familiar 
formulas for the Catalan numbers follow directly: Cn = (2re)!/(re!(re +1)!) = 
Cn )/T + !)• More generally, the presented bijection maps permutations of 
length dn to labeled re-node d-ary trees, and vice versa. By an argument 
analogous to that for binary case, one may conclude that the number of 
distinct unlabeled d-ary trees with re nodes and m = {d — l)re -|- 1 leaves is 
(dre)!/(re!m!) = ('^)/m. 

Section 2 below illustrates the bijection for the case of binary trees. 
Section 3 gives an algorithm that maps permutations to labeled d-ary trees, 
and Section 4 gives the inverse algorithm. Section 5 gives a more abstract 
characterization of the bijection. Section 6 concludes. 
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2 Tree Construction using Permutations 


Let us begin by considering how one might construct a labeled binary tree 
from tree fragments, using a permutation to guide the construction. 



Figure 1: Tree fragments 

Figure la depicts a leaf labeled 2 and a node labeled 1. Each of these tree 
fragments has an upward sprout, marked “+”, directed toward an unresolved 
parent; the node additionally has left and right downward sprouts, marked 
directed toward unresolved children. We associate with each sprout a 
unit charge of the indicated sign. The depicted leaf thus carries a net charge 
of +1, while the node carries a net charge of —1 (i.e., one positive and two 
negative unit charges). In the text, let denote a leaf with label i, and let 
{u Qi v) denote a node with label i and with left and right subtrees u and v, 
respectively; let ” denote an unresolved subtree. Then Figure la depicts 
£2 and (- 01 -). 

Figure 16 shows the same leaf and node after two of the sprouts have 
been connected by an edge. The resultant structure is the incomplete tree 
{£2 ©1 —)• When two sprouts of opposite sign are connected together, their 
charges annihilate; total charge is thus conserved. In the present instance, 
the leaf and node each acquire a net charge of zero when they are connected. 

In Figure Ic we see the atomic fragments needed to construct an arbi¬ 
trary labeled 2-node binary tree. (We assume without loss of generality that 
a labeled 2-node binary tree has node labels 0 and 1, and leaf labels 2, 3, 
and 4.) The combined charge of all the fragments comes to -|-1; moreover, 
since charge is conserved when fragments are connected, any structure built 
from these fragments must also carry a total charge of -|-1. A consequence 
that we rely on below is that any such structure must contain at least one 
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Figure 2 ; Step-by-step construction of a tree from fragments 


individual node or leaf whose net charge is positive, and hence -|-1 (since no 
other net positive charge can arise at a single node or leaf). As illustrated in 
the figure, to identify the left and right downward sprouts of node i we use 
the numbers 2i and 2i-|-l, respectively. (We do not number upward sprouts.) 
Unlike node and leaf labels, sprout numbers may not be freely reassigned: 
they stand in a fixed relationship to the corresponding node labels. 

Now suppose we wish to use a permutation—say, (3, 2,0,1)—to guide the 
construction of a labeled 2-node binary tree. Figure 2 illustrates the step- 
by-step construction of a tree from the fragments in Figure Ic. Each panel of 
Figure 2 augments the construction with a new edge shown as a dashed line. 
The downward sprouts participating in the new edges are numbered 3, 2, 0, 
and 1, respectively; in other words, we have used the permutation (3, 2, 0,1) 
to decide the order in which the downward sprouts acquire children. The 
child assigned to each new edge is determined as follows: it is a node or 
leaf whose net charge is -|- 1 , and among all such nodes and leaves, it is the 
one whose label is smallest. (As noted above, at least one such node or leaf 
must always exist.) Thus, in Figure 2a the new child is £ 2 , and in Figure 2b 
it is £ 3 . The node labels 0 and 1 are smaller than the leaf labels 2 and 3, 
but it is only after acquiring the children £2 and £3 that one of the nodes— 
node 1—attains a net charge of -|-1. Node 1 then becomes the new child in 
Figure 2c; ^4 finally plays the role of new child in Figure 2d. At the end 
of this construction we have the tree ((^3 ©1 £ 2 ) ©o ^ 4 )- The completed tree 
can be characterized as a finite function from downward sprout numbers to 
child labels, namely {0 1 —> 1,1 1 —> 4,2 1 —> 3, 3 1 -^ 2}, or, equivalently, as a 
vector of child pointers (1,4,3,2). This vector contains every node and leaf 
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label except that of the tree’s root (node 0 in this example). 

Distinct permutations yield distinct labeled trees under this construc¬ 
tion, as we shall establish through the algorithms presented below. Here we 
merely show that the illustrated construction must yield a tree, as opposed 
to a cyclic or disconnected graph of some kind. 

Call a tree or subtree complete if it has no unresolved descendants; call 
a node complete if it is the root of a complete tree or subtree. Leaves are 
trivially complete. Now imagine that the acquisition of a child c by a parent 
p results in a cycle among the tree fragments; this outcome is possible only if 
c was previously an ancestor of p. But since p was previously incomplete (else 
it could not have acquired the child c), its ancestor c was also incomplete. 
Thus, a cycle can arise only when a node acquires an incomplete child. 

At each step of the construction above we required that the new child 
be a node or leaf with net charge -|-1. That requirement implied inductively 
that each such node or leaf was complete: it had no unconnected downward 
sprouts (which would have contributed negative charge), and its children, 
if any, must have themselves been complete when it acquired them. By 
ensuring completeness of new children, the requirement of net charge +1 thus 
precluded cycles. In addition, the presence of positive charge indicated that 
a prospective child did not already have a parent, so no separate bookkeeping 
was needed to make that determination. 

Each time an edge was added to the construction, two previously dis¬ 
connected tree fragments became connected, and the total number of dis¬ 
connected fragments decreased by one. (The two fragments joined by the 
edge could not have been connected previously, because if they had been, 
then connecting them anew would have created a cycle—a possibility we 
have ruled out.) After the addition of four edges, therefore, the five initial 
fragments of Figure Ic necessarily coalesced into a connected acyclic graph. 
The unique node carrying net charge -|-1 at the end of the construction had 
to be the root of a complete tree which, by connectivity, could be counted 
on to contain every node and leaf in the graph. 

3 A General Algorithm 

The construction from Section 2 above extends straightforwardly to d-ary 
trees. Consider a complete d-ary tree with n nodes and m leaves. Each 
node is parent to d edges, giving dn edges altogether. In a construction 
that adds one edge connecting two fragments on each step, the number 
of construction steps must equal the number of edges dn, and the initial 
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' for i = 0 to d*n 

if i < n then let 
if i >= n then let 
. next i 


charge[i] 
charge [i] 


+1 + d*(-l) 
+ 1 


for i = 0 to d*n - 1 


b 


for j = 0 to d*n 

if charge[j] = +1 then exit for 
next j 


( 

■*( 


let k = perm[i] 
let kid[k] = j 

let charge[j] = 0 

let charge [int(k/d)] = charge[int(k/d)] + 1 


next i 


Figure 3; Code for mapping a permutation to a labeled tree 


number of fragments n + m must exceed this value by 1; that is, n + m = 
dn + 1. This constraint on d, n, and m can be seen in terms of charge 
as well: Each of the dn construction steps consumes a positive charge at 
one of the n + m initial upward sprouts, and a negative charge at one of 
the dn initial downward sprouts; when the tree is fully connected, only a 
single positive charge survives at the root. We thus confirm the constraint 
(n + m — dn) — {dn — dn) = +1, oi n + m — dn = +1, with the corollary 
m = {d — l)n + 1. 

Figure 3 gives a True BASIC program that maps permutations of length 
dn to labeled d-ary trees. Assumed given are d, n, and a permutation 
perm[0], ... , perm[dn — 1] of the integers 0,... ,dn — 1. From these inputs 
the program computes a representation of a labeled n-node d-ary tree as a 
vector of child pointers kid[0], ... , kid[dn — 1], where kid[dq + r] gives the 
label of the rth child of node g, for 0 < q < n and 0 < r < d. The integers 
0,... , n — 1 serve as node labels, and n,... ,dn as leaf labels. 

The code segment marked a in Figure 3 initializes the array charge[0], 

... , charge[dn\, which gives, for each i, the net charge of the node or leaf 
labeled i. Each node starts out with net charge 1 — d (reflecting one upward 
and d downward sprouts), and each leaf, with -|-1 (reflecting just an upward 
sprout). As before, we have the invariant Y^^charge[i] = -t-1, which again 
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( 0 , 1 , 2 , 3 ) ^ 
(0,1,3,2) ^ 
(0,2,1,3) ^ 
(0,2,3,1) ^ 
(0,3,1,2) ^ 
(0,3,2,1) ^ 

(1,0,2,3) ^ 
(1,0,3,2) ^ 
(1,2,0,3) ^ 
(1,2,3,0) ^ 
(1,3,0,2) ^ 
(1,3,2,0) ^ 


((a + 6) X c) 
(c X (a + b)) 
(6 X (a + c)) 
(a + (6 X c)) 
((a + c) X 6) 
(a + (c X b)) 

{{b + a) X c) 
(c X (6 + a)) 
(6 X (c + a)) 
{(b X c) + a) 
((c + a) X b) 
((c X 6) + a) 


(2,0,1,3) ^ 
(2,0,3,!) ^ 
(2,1,0,3) ^ 
(2,1,3,0) ^ 
(2,3,0,1) ^ 
(2,3,1,0) ^ 

(3,0,1,2) ^ 
(3,0,2,1) ^ 
(3,1,0,2) ^ 
(3,1,2,0) ^ 
(3,2,0,1) ^ 
(3,2,1,0) ^ 


(a X (6 + c)) 
(6 + (a X c)) 
(a X (c + b)) 
{{a X c) + b) 
((a X b) + c) 
(c + (a X b)) 

{{b + c) X a) 
(6 + (c X a)) 
((c + b) X a) 
((c X a) + b) 
((b X a) +c) 
(c+ (b X a)) 


Figure 4; A correspondence between permutations and labeled binary trees 


assures us that at every step of our construction there will be some node or 
leaf whose net charge is positive. 

The remainder of Figure 3 consists of a loop on i that iteratively performs 
one construction step, as follows: First (in code segment b), it finds the 
smallest label j such that charge[j] = +1; second (in code segment c), 
it makes node or leaf j the child of downward sprout number k, where 
k = perm[i]] and third (in code segment rf), it updates the charges associated 
with node or leaf j and with its new parent. The expression int{k/d) in 
code segment d denotes [fc/dj, which yields the label of the parent node that 
owns downward sprout number k. When the code in Figure 3 completes after 
dn loop iterations, the resultant tree’s root label can be ascertained by once 
again computing [k/d\: in general, this expression identifies a node that has 
just acquired a child, and the node that acquires a child last must be the 
root. 

Figure 4 shows a correspondence between permutations of length 4 and 
labeled 2-node binary trees. This correspondence was obtained by running 
the algorithm of Figure 3 on each possible permutation of (0,1,2,3) with 
d = 2 and n = 2. For readability, the nodes in Figure 4 are designated -|- 
and X, and the leaves, a, b, and c. Observe that a is the left-hand operand 
of -|- in the first six expressions, which correspond to permutations that 
start with 0; in the next six, a is the right-hand operand of -f, reflecting 
permutations that start with 1; and so on. Similar patterns within each 
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group of six expressions reflect the second, third, and fourth components of 
the permutations. 


4 Inverting the Algorithm 

The algorithm of Figure 3 can easily be run “in reverse” to perform the 
inverse mapping from labeled d-ary trees to permutations. That is, given 
d, n, and a representation of a tree in the kid[] array, the algorithm can be 
made to retrace the construction of that tree, and to compute in perm[] the 
permutation that would have caused that particular tree to be constructed. 

To achieve this reversal, one need only replace code segment c in Figure 3 
with code segment c’ from Figure 5. The difference between c and c' is that 
where the former consults perm[] and assigns to kid[], the latter does the 
opposite; it consults kid[] and assigns to perm[]. Neither array is referenced 
anywhere else in the algorithm, so it is immaterial which array is the source 
of information, and which the recipient. The only aspect of code segment 
c or c' that matters to the rest of the algorithm is that (aside from not 
disturbing i, j, oi charge[]) this code segment must furnish ink a. succession 
of distinct values from 0,... , dn — 1. In the case of code segment c, this 
requirement is met in that the values k come from the permutation perm[]. 
In the case of c' , the values k are the indices of the child pointers that point 
to the labels j of all the non-root nodes and leaves of the given tree; since 
each such node or leaf is the target of a unique child pointer, each value 
k in 0,... , dn — 1 will be furnished exactly once. In either case, we have 
perm[i] = k and kid[k] = j after execution of code segment c or c' . Thus, 
the relationship between perm[] and kid[] will be the same whether the 
algorithm is run forward or “in reverse.” 

Code segment b in Figure 3 chose, from among the nodes and leaves 


for k = 0 to d*n - 1 

if kid[k] = j then exit for 
next k 

let permfi] = k 


Figure 5; Code revision for mapping a labeled tree to a permutation 
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carrying charge + 1 , the one with the smallest label j; but that choice was 
somewhat arbitrary. So long as code segment b implements some deter¬ 
ministic policy for choosing a j such that charge[j] = -|- 1 , the algorithm of 
Figure 3 remains invertible through substitution of code segment c' for c. 


5 An Abstract Characterization 


The mappings presented above also admit a more abstract characterization. 
Assume d and n fixed, and let Q = {0,... , n—1} and i? = {0,... , d—1}. For 
q in Q, let 4’g{... , Ur ,...) denote a node labeled q with immediate subtrees 
Ur for r in ii. If d = 2, the notation (pg{uo,ui) is equivalent to {uq Qg ui). 
We define the maxleaf and height of a tree in the obvious way; 

maxleaf ip = p height = 0 

maxleaf {(j)g{... , Ur, ...))= height{(l)g{... , Ur, ...)) = 

max{... , maxleaf Ur ,... } 1 + max{... , height Ur, ■ ■ ■} 

We then define the relation u ^ v to hold on trees u and v just if maxleaf u < 
maxleaf u, or if maxleaf u = maxleaf v and height u < height v. 

Now let t be a d-ary tree with node labels 0,... ,n — 1, and leaf la¬ 
bels n,... , dn. We define the permutation P{t) as follows. First, for k in 
0 ,... , dn — 1 , we define crk{t), or for short, to be the proper subtree v of 
t such that V is the rth child of the node labeled q, where q = [fe/dj and 
r = k — dq. We then take P{t) to be the permutation vr on 0,... , dn — 1 such 
that (Tjr(o) ^ A This permutation is well-defined because the 

proper subtrees of t must be totally ordered under if two such subtrees 
share the same maximum leaf label (or indeed if they share any node or leaf 
at all), they must be of different heights, or else they are the same subtree. 

Conversely, if tt is a permutation on 0,... , dn — 1, we define the tree T^n) 
as follows. First, for q in Q, we define i{q) = 1 -|- max{ 7 r“^(dg -|- r) | r G ii}. 
Since i is injective, we may unambiguously define, for i in 0,... , dn, 


n = 


M- ■ ■ ^T-K-^dq+r),- ■■) if * = 4 ?) for some q^Q, 
^n+i—4t{q&Q I L{q)<i} Otherwise. 


We then take T( 7 r) to be Tdn- 


Example Let d = 2 and n = 2, and suppose the tree t = 4 >o{ 4 >i{i 3 ,i 2 ),i 4 ) 
is given. Its proper subtrees are no = 4 >i{i 3 ,i 2 ), o'l = £ 4 , a 2 = £ 3 , and 
1 T 3 = £ 2 - These subtrees fall in the order <73 -< 02 A no A ni, which 



induces the permutation tt = (3,2,0,1); thus we have P{t) = (3,2,0,1). 
Conversely, suppose the permutation tt = (3,2,0, 1) is given. The indices 
L{q) are then l{0) = l+max{7r“^(0), 7r“^(l)} = l+max{2, 3} = 4, and /.(I) = 
l+max{7r“^(2), 7r“^(3)} = l+max{l, 0} = 2. We then have tq = £ 2 ] ti = £ 3; 
^2 ^1 (2)? "^ 77 ”^(3)) ^3 "^ 4 ? and 74 1^0(^ 7 r~^(0) 5 (1)) 

0o(T2,r3). Hence T(7r) = 0o(T2,r3) = 0o(</>i(4, ^2), ^4)- □ 

We now sketch a proof that P and T are inverses. To begin, suppose 
we apply T to a given tt. In the definition of T, each Ti acquires a distinct 
root label, hence all labels in 0,... ,dn are represented. Moreover, for i in 
0 ,... , dn — 1, the parent of Ti is some tt with i < i' < dn, so by transitivity, 
each Ti is a subtree of T^n = By construction, we also have Tj ^ 

Tj+i for each i, hence tq ^ ^ Tdn- Now suppose we apply P to T{tt). 

The rth child of node q in T{tt) was defined to be r^-i(rfq+r), so from the 
definition of it follows that a(iq+riT{n)) = T^-i(dq+r)i equivalently, we 
have (Tfc(T(7r)) = r^-i(fc) for /c in 0,... ,dn — 1. Letting i = TT~^{k), so that 
7r(z) = k, we obtain iT^(j)(T(7r)) = r* for i in 0,... ,dn — 1, which permits us 
to rewrite tq ^ ... ~< Tdn-i as <7^(0)(r(7r)) ^ ... ^ crnidn-i){T{7r)). Then by 
the dehnition of P, we have P{T{tt)) = n. 

Next consider i = T{P{t)). Let vr = P{t); then P{i) = P{T{P{t))) = 
P{T{it)) = IT also. Let us extend vr with Tr{dn) = dn, and for all t let Cdnit) 
denote t; we then have <77^(0)(i) ^ ^ (^iT{dn)i't) = L and similarly for t. 

We shall show by induction on i that <77^(4) (t) = cr.,^(^i^(t), and hence that 
t = t. If (7jr(j)(t) is a node, it must have the form (j)q{... , adq+rit), ■ ■ ■), 
with adq+r(t) <7^(1)(c for each r in R. By the inductive hypothesis, 
adq+r{t) = o'dq+rit) for each r, hence <pq{-■ ■ 1 f^dq+r{t),...) is also a sub¬ 
tree of t. Alternatively, if cr^(^i-^{t) is just a leaf £j, this leaf must also occur 
as a subtree of t. In either case, we deduce that (7^(i)(t) = <7,^(7') (t) for some 
i! > i. Symmetrically, we have a'.,^(^i)(t) = <77^(7")(t) for some i" > i. It follows 
that ^ ^ and hence that 

We remark without proof that T and P are exactly the mappings of 
Sections 3 and 4. 

6 Conclusion 

We have presented an algorithm for mapping permutations to labeled trees, 
as well as a variant of that algorithm that performs the inverse mapping. 
By establishing that these mappings are bijective, we have shown that each 
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of the factorials in the formulas for the Catalan numbers and their d-ary 
analogues can be understood as a count of permutations. 
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